System, Method and Apparatus for Preventing Data Loss Due to Memory Defects Using Latches

ABSTRACT

A system and method for operating a memory system includes receiving a first user data, writing the first user data to a first buffer, writing the first user data from the first buffer to a first selected memory location, writing the first user data from the first buffer into a second buffer when the first user data was successfully written to the first selected memory location. Data is retrieved from the first selected memory location and written into the first buffer. Data in the first buffer can be matched to the user data in the second buffer to confirm a successful storage of the first user data in the memory system. A previously stored user data can be retrieved from a third selected memory location and written into a third buffer when the previously stored user data was stored in the memory system before the first user data.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of and claims priority from U.S.patent application Ser. No. 14/300,155 filed on Jun. 9, 2014 andentitled “System, Method and Apparatus for Preventing Data Loss Due toMemory Defects Using Latches,” which is incorporated herein by referencein its entirety.

BACKGROUND

The present invention relates generally to memory systems, and moreparticularly, to methods and systems for using and testing the integrityof memory systems.

Solid state memory systems are manufactured in great volume and in manydifferent forms including volatile type memory circuits and non-volatiletype memory circuits. As part of the production process the completedmemory circuits are tested to confirm the proper operation of thememory. Specifically, the completed memory circuits are tested toconfirm the memory circuit can be written to and the data that waswritten (i.e., stored) in the memory circuit can accurately be read backfrom the memory circuit. Typically, a high voltage write and erasecycles and read operations are applied to the memory circuit. The highervoltage of the operations physically stress the semiconductor devices(e.g., gates, P-N junctions, conductive lines, etc.) greater thantypical operating voltage so as to cause physically weaker semiconductordevices to fail.

There are many types of tests performed as part of the manufacturingprocess. These manufacturing process tests can reliably only identifymemory circuits that fail due to manufacturing defects or fail duringearly operations, often referred to as infant mortality, occurring in anearly portion of the projected service life of the memory circuit.

The memory circuits that pass the manufacturing process tests are thenshipped to end users and distributors. Unfortunately, many memorycircuits can fail later in the projected service life, well after thememory circuits successfully passed the manufacturing process tests.When memory circuits fail, the data stored therein can be lost,corrupted or otherwise rendered not accessible and effectively lost.

In view of the foregoing, there is a need for system and method forpreventing data loss due to memory cell failure.

SUMMARY

Broadly speaking, the present invention fills these needs by providingsystems and methods for preventing data loss due to memory cell failure.It should be appreciated that the present invention can be implementedin numerous ways, including as a process, an apparatus, a system,computer readable media, or a device. Several inventive embodiments ofthe present invention are described below.

One embodiment provides a method for operating a memory system includingreceiving a first quantity of user data (DATAn), writing the firstquantity of user data (DATAn) to a first buffer, writing the firstquantity of user data (DATAn) from the first buffer to a first selectedmemory location (WLn) in a first selected memory block, writing thefirst quantity of user data (DATAn) from the first buffer into a secondbuffer when the first quantity of user data (DATAn) was successfullywritten to the first selected memory location (WLn) of the firstselected memory block. A quantity of stored data (DATAn′) can beretrieved from the first selected memory location and the retrievedquantity of stored data (DATAn′) can be written into the first buffer.The quantity of stored data in the first buffer (DATAn′) can be matchedto the first quantity of user data (DATAn) in the second buffer toconfirm a successful storage of the first quantity of user data, in thememory system.

The method can also include writing the first quantity of user data(DATAn) from the first buffer into a second selected memory location WLnof a second selected memory block in the memory system when the firstquantity of user data (DATAn) was not successfully written to the firstselected memory location. The method can also include writing a secondquantity of user data (DATAn−1) to a third selected memory location inthe memory system when the second quantity of user data (DATAn−1) isstored in a third buffer of the memory system.

The third selected memory location can be adjacent to the secondselected memory location. The third selected memory location can precedethe second selected memory location in the memory system. The secondselected memory location can be included in a dedicated error handlingmemory block in the memory system. The dedicated error handling memoryblock can be included in the memory array or maintained separately fromthe memory array.

Receiving the first quantity of user data (DATAn) to be stored in thememory system can also include retrieving a quantity of previouslystored user data (DATAn−1) in a third selected memory location andwriting the retrieved quantity of previously stored user data (DATAn−1)into the third buffer when the previously stored user data (DATAn−1) wasstored in the memory system before the first quantity of user data(DATAn). Writing the retrieved quantity of previously stored user data(DATAn−1) into the third buffer can include writing the retrievedquantity of previously stored user data (DATAn−1) into the first buffer,before writing the first quantity of user data (DATAn) to the firstbuffer and writing the retrieved quantity of previously stored user data(DATAn−1) from the first buffer into the third buffer.

The method can also include writing the first quantity of user data(DATAn) from the second buffer into the second selected memory locationin the memory system when the quantity of stored data (DATAn′) in thefirst buffer does not match the first quantity of user data (DATAn) inthe second buffer. A second quantity of user data (DATAn−1) can bewritten to the third selected memory location in the memory system whenthe second quantity of user data (DATAn−1) is stored in a third bufferof the memory system.

The method can also include reporting a detected memory failure.Reporting the detected memory failure can include identifying the memorylocation of the memory failure.

Another embodiment provides a method for operating a memory systemincluding writing a second quantity of user data (DATAn−1) into a thirdbuffer of the memory system, receiving a first quantity of user data(DATAn) to be stored, the second quantity of user data (DATAn−1)preceding the first quantity of user data (DATAn). The first quantity ofuser data (DATAn) is written to a first buffer. The first quantity ofuser data (DATAn) is also written from the first buffer to a firstselected memory location (WLn) in a first selected memory block. Thefirst quantity of user data (DATAn) is written from the first bufferinto a second buffer of the memory system when the first quantity ofuser data (DATAn) was successfully written to the first selected memorylocation. A quantity of stored data (DATAn′) in the first selectedmemory location is retrieved and written into the first buffer. Thequantity of stored data (DATAn′) in the first buffer is confirmed tomatch the first quantity of user data (DATAn) in the second buffer. Whenthe first quantity of user data (DATAn) was not successfully written tothe first selected memory location, the first quantity of user data(DATAn) is written from the first buffer into a second selected memorylocation (WLn) in a second selected memory block in the memory systemand the second quantity of user data is written to a third selectedmemory location (WLn−1) in the second selected memory block in thememory system, wherein the second selected memory location and the thirdmemory location are included in a separate memory block from the firstselected memory location.

Another embodiment provides a memory system including a memory array ofmultiple memory blocks of memory storage locations, a memory controllercoupled to the memory array and a memory operating system logic. Thememory operating system logic includes computer readable instructions oncomputer readable media for receiving a first quantity of user data(DATAn), writing the first quantity of user data (DATAn) to a firstbuffer and also writing the first quantity of user data (DATAn) from thefirst buffer to a first selected memory location (WLn) of a firstselected memory block, writing the first quantity of user data (DATAn)from the first buffer into a second buffer when the first quantity ofuser data (DATAn) was successfully written to the first selected memorylocation (WLn), retrieving a quantity of stored data (DATAn′) in thefirst selected memory location and writing the retrieved quantity ofstored data (DATAn′) into the first buffer. Confirming the quantity ofstored data (DATAn′) in the first buffer matches the first quantity ofuser data (DATAn) in the second buffer.

Other aspects and advantages of the invention will become apparent fromthe following detailed description, taken in conjunction with theaccompanying drawings, illustrating by way of example the principles ofthe invention.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will be readily understood by the followingdetailed description in conjunction with the accompanying drawings.

FIG. 1 is a block diagram of a memory system, for implementingembodiments of the present disclosure.

FIG. 2 is a simplified block diagram of a memory block, for implementingembodiments of the present disclosure.

FIG. 3 is a flowchart diagram that illustrates the method operationsperformed in for preventing data loss due to memory cell failure in aselected memory block, for implementing embodiments of the presentdisclosure.

FIG. 4A illustrates the movement of DATAn−1 in operations 315 and 320,for implementing embodiments of the present disclosure.

FIG. 4B illustrates the movement of DATAn in operations 330 and 335, forimplementing embodiments of the present disclosure.

FIG. 5A illustrates the movement of DATAn in operation 345 and DATAn−1in operation 350, for implementing embodiments of the presentdisclosure.

FIG. 5B illustrates the movement of DATAn in operation 360 and DATAn′ inoperation 365, for implementing embodiments of the present disclosure.

FIG. 6 illustrates the movement of DATAn in operation 380 and DATAn−1 inoperation 385, for implementing embodiments of the present disclosure.

FIG. 7 is a block diagram of an example computer system, forimplementing embodiments of the present disclosure.

DETAILED DESCRIPTION

Several exemplary embodiments for systems and methods for preventingdata loss due to memory cell failure will now be described. It will beapparent to those skilled in the art that the present invention may bepracticed without some or all of the specific details set forth herein.

Memory circuit defects such as a broken (i.e., open) wordline(s) andweak or shorted control gate can corrupt the data during the writeprocess without providing a timely fail status. Other defects such aswordline to wordline shorts can also corrupt data from the precedingwordline(s). When the host attempts to read wordlines with such defects,an uncorrectable error correction code (UECC error) is often received.Unfortunately, in many instances the data may not be recoverable by thetime the UECC error is received by the host. As a result, data can belost.

One solution includes storing data for wordline(s) n (currently writtendata) and for wordline(s) n−1 in latches or buffers. The data forwordline(s) n and for wordline(s) n−1 can be recovered if a UECC erroror other defect is encountered. Before current data wordline (WLn) iswritten to a memory block, the data from previous wordline (WLn−1) isread into a buffer or other latch circuit such as a transfer latch andthen transferred to an unused buffer or latch circuit. The current datafor WLn is written to the memory block and also transferred to anotherunused buffer or latch circuit. At the end of the write operation, thedata is located as follows:

WLn—Written to memory block

WLn—Copy available in first buffer or latch

WLn−1—Copy available in second buffer or latch

WLn is read from the memory block for an error check such as a check bitor other error detection process. If the error check passes, the writeis finalized and next command is processed. If the error check isfailed, the good data is still available in the first and secondbuffers/latches for WLn and WLn−1, respectively, and can be recovered bycopying the data from the first and second buffers/latches to a newmemory block. A dedicated error handling memory block can be referred toas a safe zone memory block and can be set aside for handling memoryfailures. The dedicated error handling memory block can be included inthe memory array or maintained separate from the memory array. The aboveprocess and system allows the memory system to be more tolerant ofmemory cell circuit defects such as broken wordlines, wordline towordline shorts and a weak control gate.

As a result, the memory system can handle memory cell defects and yieldsa more robust and more reliable memory system and ensure integrity ofthe data stored in the memory. The above systems and methods alsoprovide improved handling of fallouts and also provides the ability toreclaim memories and safely and accurately recover data stored in failedmemory cells.

FIG. 1 is a block diagram of a memory system 100, for implementingembodiments of the present disclosure. The memory system 100 is coupledto or included as part of a host computer system 120. The memory system100 includes one or more memory integrated circuits 101. The memoryintegrated circuits 101 include a memory array 102. The memory array 102includes many memory cells arranged in logical blocks 102A-n+m of memorycells (e.g., memory blocks 102A-n+m). The memory integrated circuits 101can also include buffer or latch circuits A-C, 101A-101C.

The memory system 100 also includes a memory controller 110. The memorycontroller 110 is coupled to one or more memory integrated circuits 101by a bus 111. The memory controller 110 can include circuits forcontrolling the memory system 100 such as a memory control unit, aprocessor, buffers, memory operating system logic and other memorycontroller logic. The memory operating system logic can alternatively oradditionally be included in the memory array 102.

The memory blocks 102A-n+m can include nonvolatile memory (NVM) such asflash memory (e.g., NAND flash) and other types of memory such as NOR,ORNAND, 3D memory and charge trapping memory with material such assilicon nitride, and phase change memory such as ReRAM, andsemi-volatile memory such as DRAM and SRAM. The memory blocks 102A-n+mcan also include volatile type memory circuits. The volatile type memorycircuits can include, as an example, dynamic random access memory (DRAM)and static random access (SRAM) memory circuits.

The reliability of the user data storage in the memory blocks can beimproved using the principles disclosed below for any type of memorythat includes a memory cell, an accessing wordline and an accessingbitline. While the following examples discuss using a wordlines, similarprocesses can be applied to bitlines instead of or in addition to thewordline processes.

FIG. 2 is a simplified block diagram of a memory block 102A, forimplementing embodiments of the present disclosure. Each of the memoryblocks 102A-n+m undergo write and erase cycles and read accesses. Thememory blocks 102A-n+m can suffer device failures as the cycling/accesscounts increase. Typical failure modes in a memory block can include butare not limited to:

A wordline to wordline short;

Broken (open) wordlines;

A wordline to wordline short 204 is shown in FIG. 2. The wordline towordline short 204 typically occurs due to an arc through an insulatorelectrically separating the wordlines WL4 and WL5 and thus allowing thetwo wordlines to have the same voltage during operations. This leads towriting the same data to the memory cells coupled to both wordlines WL4and WL5.

A break or open 206 in wordline WL9 is shown in FIG. 2. The open 206 inwordline WL9 is typically caused by a failure in the conductor thatforms wordline WL9 such as caused by heat, voltage or current stresses.The open 206 in the wordline WL9 prevents wordline WL9 from accessingthe memory cells further to the right of the open 206.

FIG. 3 is a flowchart diagram that illustrates the method operations 300performed in for preventing data loss due to memory cell failure in aselected memory block, for implementing embodiments of the presentdisclosure. In an operation 305, a command is received from the hostcomputer system 120 to write a quantity of user data (DATAn) to aselected memory block 102 n. In operation 310, if n is equal to 0 thenthe received write command is the first write command and no previoususer data was stored in the memory system 100. If n is equal to 0, themethod operations continue in an operation 330 as described below.

If n is not equal to 0 in operation 310, then the received write commandis not the first write command and there were at least one previousquantity of user data (DATAn−1) stored in the memory system 100. If n isnot equal to 0 in operation 310, then the method operations continue inan operation 315.

In operation 315, the previously stored quantity of user data, DATAn−1is read from the previous wordline(s), WLn−1, in the selected memoryblock 102 n and written into a first buffer 101A. In an operation 320,the DATAn−1 is read from the first buffer 101A and into a second buffer101B.

FIG. 4A illustrates the movement of DATAn−1 in operations 315 and 320,for implementing embodiments of the present disclosure. User dataDATAn−1 was previously stored in the memory block 102 n, before the userdata DATAn was received in operation 305. User data DATAn−1 waspreviously stored in a preceding location WLn−1 in the memory block 102n. User data DATAn−1 is read from the location WLn−1 in the memory block102 n and written into the first buffer 101A. The user data DATAn−1 isread from the first buffer 101A and written into the second buffer 101B.

While the examples described herein discuss single wordlines andquantities of data read from and written to single wordlines and thebuffers 101A-C, it should be understood that a quantity of data spanningmore than one wordline can be read from and written to the buffers101A-C in substantially similar fashion. A capacity of the buffer 101A-Cdetermines how much user data can be processed. A larger capacity buffer101A-C can process more user data than a smaller capacity buffer. Itshould also be understood that each of the buffers 101A-C can beincluded in a single buffer and that each of the buffers 101A-C caninclude more than one buffer.

In operation 330, the received data, DATAn is written into the firstbuffer 101A. In an operation 335, the DATAn is read from the firstbuffer 101A and written into a selected wordline WLn of the selectedmemory block 102 n.

FIG. 4B illustrates the movement of DATAn in operations 330 and 335, forimplementing embodiments of the present disclosure. The user data DATAnis initially written into the first buffer 101A when received in thememory system 100. The user data DATAn is then written into the selectedmemory location WLn of the selected memory block 102 n.

In an operation 340, the write operation is examined to determine if thewrite operation was successful. There are many ways well known in theart to determine if the write operation is successful. By way ofexample: a check bit/count can be used or all or a portion of DATAn′stored in WLn can be read and compared to the corresponding portion ofDATAn in the first buffer 101A. An unsuccessful write operation isindicated if the DATAn in the first buffer 101A was not accuratelywritten to the WLn of the selected memory block 102 n. Conversely, asuccessful write operation is indicated if the DATAn in the first buffer101A was accurately written to the WLn of the selected memory block 102n. If, in operation 340, the write operation was successful, the methodoperations continue in an operation 360 described below.

If, in operation 340, the write operation was not successful, the methodoperations continue in an operation 345. The DATAn in the first buffer101A is written to WLn of another selected memory block such as memoryblock 102 n+m in operation 345. The value m can be equal to some valuesufficient to indicate another memory block. It should also beunderstood that 102 n+m could also indicate a different portion of thesame memory block 102 n. By way of example, different wordlines otherthan WLn, WLn−1 within memory block 102 n can be indicated by memoryblock 102 n+m.

In one implementation, the another selected memory block 102 n+m can bea dedicated memory block referred to as a safe zone memory block and canbe set aside specifically for handling memory failures. The safe zonememory block and can be included in or separate from the memory array102. Alternatively, the another selected memory block 102 n+m can be anysuitable memory block 102 a-102 n−1 in the memory array 102. In oneimplementation, the write operation in operation 345 can be processedsimilar to operation 305 as described above, with regard to the newlyselected memory block 102 n+m. In an operation 350, the DATAn−1 iswritten from the second buffer 101B to WLn−1 of memory block 102 n+m.

FIG. 5A illustrates the movement of DATAn in operation 345 and DATAn−1in operation 350, for implementing embodiments of the presentdisclosure. The write operation in operation 335 was not successful andthe user data DATAn is still maintained in the first buffer 101A untilthe user data DATAn is successfully stored in the memory array 102.Previously stored user data DATAn−1, if present, was copied to thesecond buffer 101B, as described above in FIG. 4A, before the writeoperation in operation 335. Due to the failed write operation 335, thepreviously stored data DATAn−1 is presumed to be corrupted when theunsuccessful write operation of user data DATAn was attempted. The userdata DATAn−1 stored in the second buffer 101B is presumed to be anaccurate copy of the previously stored user data DATAn−1 that waswritten to location WLn−1 in the selected memory block 102 n as userdata DATAn−1 was copied into the second buffer 101B before the userDATAn was stored in the adjacent location WLn.

The unsuccessful write operation 335 of user data DATAn indicates adefect or failure in the memory block 102 n or at least the locationsWLn and WLn−1 of memory block 102 n. The unsuccessful write operation335 of user data DATAn initiates a first data recovery process. Thefirst data recovery process writes the user data DATAn in the firstbuffer 101A and the previously stored user data DATAn−1 in the secondbuffer 101B to respective memory locations WLn and WLn−1 in a new memoryblock 102 n+m.

In an optional operation 355, the write failure can be reported to thehost computer system 120 and/or the memory controller 110 and the methodoperations can end. The write failure indicates the memory block 102 nsuffering the write failure. The write failure can also indicate thewordlines WLn, WLn−1 associated with the detected failure. The memorycontroller 110 and/or host computer 120 can then identify the memoryblock 102 n and/or wordlines WLn, WLn−1 as being damaged and unusable.The entire memory block 102 n or at least the defective wordlines WLn,WLn−1 from the memory block can be prevented from being used in a futurewrite operation.

If, in operation 340, the write operation was successful, the methodoperations continue in operation 360 where DATAn is written from thefirst buffer 101A to a third buffer 101C. In an operation 365, theDATAn′ is read from WLn into the first buffer 101A.

FIG. 5B illustrates the movement of DATAn in operation 360 and DATAn′ inoperation 365, for implementing embodiments of the present disclosure.The successful write operation 335 indicates user data DATAn wasaccurate written to the WLn of memory block 102 n. The successful writeoperation 335 initiates a verification operation. The user data DATAn ismoved from the first buffer 101A and into the third buffer 101C. The WLnof memory block 102 n is read and data stored therein DATAn′ is writteninto the first buffer 101A so the DATAn and DATAn′ can be compared.

DATAn in the third buffer 101C is compared to DATAn′ in the first buffer101A to determine if a UECC error occurred, in an operation 370. Ifthere are no errors in an operation 375, the method operations can end.If there are errors in operation 375, the method operations continue inoperation 380.

In operation 380, the DATAn in the third buffer 101C is written to WLnof the other selected memory block such as memory block 102 n+m. In anoperation 385, the DATAn−1 is written from the second buffer 101B toWLn−1 of memory block 102 n+m.

FIG. 6 illustrates the movement of DATAn in operation 380 and DATAn−1 inoperation 385, for implementing embodiments of the present disclosure.The failed verification operation 370 and the user data DATAn is stillmaintained in the third buffer 101C until the user data DATAn issuccessfully stored in the memory array 102. Previously stored user dataDATAn−1, if present, was copied to the second buffer 101B, as describedabove in FIG. 4A, before the write operation in operation 335.

Due to the failed verification operation 370, the previously stored dataDATAn−1 is presumed to be corrupted when the unsuccessful writeoperation of user data DATAn was attempted. The user data DATAn−1 storedin the second buffer 101B is presumed to be an accurate copy of thepreviously stored user data DATAn−1 that was written to location WLn−1in the selected memory block 102 n as user data DATAn−1 was copied intothe second buffer 101B before the user DATAn was stored in the adjacentlocation WLn.

The failed verification operation 370 of user data DATAn indicates adefect or failure in the memory block 102 n or at least the locationsWLn and WLn−1 of memory block 102 n. The failed verification operation370 of user data DATAn initiates a second data recovery process. Thesecond data recovery process writes the user data DATAn in the thirdbuffer 101C and the previously stored user data DATAn−1 in the secondbuffer 101B to respective memory locations WLn and WLn−1 in a new memoryblock 102 n+m.

In an optional operation 390, the UECC error detected in operation 370can be reported to the host computer system 120 and/or the memorycontroller 110, similar to the optional operation 355 above. Similar tooperation 355, the host computer system 120 and/or the memory controller110 can use the reported UECC error detection to identify failed memoryblock 102 n and/or failed wordlines WLn, WLn−1. After operation 390, themethod operations can end.

FIG. 7 is a block diagram of an example computer system 1000, forimplementing embodiments of the present disclosure. A general orspecialized computer system, such as the computer system 1000, can beused as the host computer system 120 as described in FIG. 1, above andused for executing the operations for performing at least a portion ofthe analyses described above. The computer system 1000 includes acomputer 1002, a display 1018, an optional printer or output device (notshown), a removable media (e.g., magnetic/optical/flash) drive 1034, amass storage system 1014 (e.g., hard disk drive, solid state drive, orother suitable data storage device), a network interface 1030, and akeyboard 1022. Additional user interface devices such as a mouse 1024, atouch pad or touch screen can also be included.

The computer 1002 includes a central processing unit 1004, one or moredata buses 1010, random access memory (RAM) 1028, read only memory (ROM)1012, and an input/output interface 1020. The computer 1002 can be apersonal computer (such as an IBM compatible personal computer, aMacintosh computer or Macintosh compatible computer), a workstationcomputer (such as a Sun Microsystems or Hewlett-Packard workstation), orsome other suitable type of computer.

The CPU 1004 can be a general purpose digital processor or a speciallydesigned processor. The CPU 1004 controls the operation of the computersystem 1000. Using instructions retrieved from memory (e.g. program(s)1008), the CPU 1004 controls the reception and manipulation of inputdata and the output and display of data on output devices.

The data buses 1010 are used by the CPU 1004 to access the RAM 1028, theROM 1012 and the mass storage 1014. The RAM 1028 is used by the CPU 1004as a general storage area and as scratch-pad memory, and can also beused to store input data and processed data. The RAM 1028 and the ROM1012 can be used to store computer readable instructions or program code1008 readable and executable by the CPU 1004 as well as other data.

The bus 1010 can also be used to access the input, output, and storagedevices used by the computer 1002. These devices include the display1018, the optional printer (not shown), the removable media drive 1034,and the network interface 1030. The input/output interface 1020 is usedto receive input from keyboard 1022 and send decoded symbols for eachpressed key to CPU 1004 over the data bus 1010.

The display 1018 is an output device that displays images of dataprovided by the CPU 1004 via the bus 1010 or provided by othercomponents in the computer system 1000. The optional printer device,when operating as a printer, provides an image on a sheet of paper or asimilar surface. Other output devices such as a plotter, projector, etc.can be used in place of, or in addition to, the printer device.

The removable media drive 1034 and the mass storage 1014 can be used tostore various types of data. The removable media drive 1034 facilitatestransporting such data to other computer systems, and mass storage 1014permits fast access to large amounts of stored data. The mass storage1014 may be included within the computer system or may be external tothe computer system such as network attached storage or cloud storageaccessible over one or more networks (e.g., local area networks, widearea networks, wireless networks, Internet 1032) or combinations of suchstorage devices and locations.

The CPU 1004 together with an operating system operate to executecomputer readable code and logic and produce and use data. The computercode, logic and data may reside within the RAM 1028, the ROM 1012, orthe mass storage 1014 or other media storage devices and combinationsthereof. The computer code and data could also reside on a removableprogram medium and loaded or installed onto the computer system 1000when needed. Removable program media include, for example, DVD, CD-ROM,PC-CARD, floppy disk, flash memory, optical media and magnetic disk ortape.

The network interface 1030 is used to send and receive data over anetwork 1032 connected to other computer systems. An interface card orsimilar device and appropriate software implemented by the CPU 1004 canbe used to connect the computer system 1000 to an existing network andtransfer data according to standard protocols such as local areanetworks, wide area networks, wireless networks, Internet and any othersuitable networks and network protocols.

The keyboard 1022 is used by a user to input commands and otherinstructions to the computer system 1000. Other types of user inputdevices can also be used in conjunction with the present invention. Forexample, pointing devices such as a computer mouse, a track ball, astylus, touch pad, touch screen or a tablet can be used to manipulate apointer on a screen of a general-purpose computer.

It will be further appreciated that the instructions represented by theoperations in the above figures are not required to be performed in theorder illustrated, and that all the processing represented by theoperations may not be necessary to practice the invention. It shouldalso be appreciated that some operations may have sub-operations and inother instances, certain operations described herein may not be includedin the illustrated operations. Further, the processes described in anyof the above figures can also be implemented in software stored in anyone of or combinations of the RAM, the ROM, or the hard disk drive.

Semiconductor memory devices include volatile memory devices, such asdynamic random access memory (“DRAM”) or static random access memory(“SRAM”) devices, non-volatile memory devices, such as resistive randomaccess memory (“ReRAM”), electrically erasable programmable read onlymemory (“EEPROM”), flash memory (which can also be considered a subsetof EEPROM), ferroelectric random access memory (“FRAM”), andmagnetoresistive random access memory (“MRAM”), and other semiconductorelements capable of storing information. Furthermore, each type ofmemory device may have different configurations. For example, flashmemory devices may be configured in a NAND or a NOR configuration.

The memory devices can be formed from passive and/or active elements, inany combinations. By way of non-limiting example, passive semiconductormemory elements include ReRAM device elements, which in some embodimentsinclude a resistivity switching storage element, such as an anti-fuse,phase change material, etc., and optionally a steering element, such asa diode, etc. Further by way of non-limiting example, activesemiconductor memory elements include EEPROM and flash memory deviceelements, which in some embodiments include elements containing a chargestorage region, such as a floating gate, conductive nanoparticles or acharge storage dielectric material.

Multiple memory elements may be configured so that they are connected inseries or such that each element is individually accessible. By way ofnon-limiting example, NAND devices contain memory elements (e.g.,devices containing a charge storage region) connected in series. Forexample, a NAND memory array may be configured so that the array iscomposed of multiple strings of memory in which each string is composedof multiple memory elements sharing a single bit line and accessed as agroup. In contrast, memory elements may be configured so that eachelement is individually accessible, e.g., a NOR memory array. One ofskill in the art will recognize that the NAND and NOR memoryconfigurations are exemplary, and memory elements may be otherwiseconfigured.

The semiconductor memory elements of a single device, such as elementslocated within and/or over the same substrate or in a single die, may bedistributed in two or three dimensions, such as a two dimensional arraystructure or a three dimensional array structure.

In a two dimensional memory structure, the semiconductor memory elementsare arranged in a single plane or single memory device level. Typically,in a two dimensional memory structure, memory elements are located in aplane (e.g., in an x-z direction plane) which extends substantiallyparallel to a major surface of a substrate that supports the memoryelements. The substrate may be a wafer over which the layers of thememory elements are deposited and/or in which memory elements are formedor it may be a carrier substrate which is attached to the memoryelements after they are formed.

The memory elements may be arranged in the single memory device level inan ordered array, such as in a plurality of rows and/or columns.However, the memory elements may be arranged in non-regular ornon-orthogonal configurations as understood by one of skill in the art.The memory elements may each have two or more electrodes or contactlines, such as bit lines and word lines.

A three dimensional memory array is organized so that memory elementsoccupy multiple planes or multiple device levels, forming a structure inthree dimensions (i.e., in the x, y and z directions, where the ydirection is substantially perpendicular and the x and z directions aresubstantially parallel to the major surface of the substrate).

As a non-limiting example, each plane in a three dimensional memoryarray structure may be physically located in two dimensions (one memorylevel) with multiple two dimensional memory levels to form a threedimensional memory array structure. As another non-limiting example, athree dimensional memory array may be physically structured as multiplevertical columns (e.g., columns extending substantially perpendicular tothe major surface of the substrate in the y direction) having multipleelements in each column and therefore having elements spanning severalvertically stacked memory planes. The columns may be arranged in a twodimensional configuration, e.g., in an x-z plane, thereby resulting in athree dimensional arrangement of memory elements. One of skill in theart will understand that other configurations of memory elements inthree dimensions will also constitute a three dimensional memory array.

By way of non-limiting example, in a three dimensional NAND memoryarray, the memory elements may be connected together to form a NANDstring within a single horizontal (e.g., x-z) plane. Alternatively, thememory elements may be connected together to extend through multiplehorizontal planes. Other three dimensional configurations can beenvisioned wherein some NAND strings contain memory elements in a singlememory level while other strings contain memory elements which extendthrough multiple memory levels. Three dimensional memory arrays may alsobe designed in a NOR configuration and in a ReRAM configuration.

A monolithic three dimensional memory array is one in which multiplememory levels are formed above and/or within a single substrate, such asa semiconductor wafer. In a monolithic three dimensional array thelayers of each level of the array are formed on the layers of eachunderlying level of the array. One of skill in the art will understandthat layers of adjacent levels of a monolithic three dimensional memoryarray may be shared or have intervening layers between memory levels. Incontrast, two dimensional arrays may be formed separately and thenpackaged together to form a non-monolithic memory device. For example,non-monolithic stacked memories have been constructed by forming memorylevels on separate substrates and adhering the memory levels atop eachother. The substrates may be thinned or removed from the memory levelsbefore bonding, but as the memory levels are initially formed overseparate substrates, such memories are not monolithic three dimensionalmemory arrays. Further, multiple two dimensional memory arrays or threedimensional memory arrays (monolithic or non-monolithic) may be formedseparately and then packaged together to form a stacked-chip memorydevice.

One of skill in the art will recognize that this invention is notlimited to the two dimensional and three dimensional exemplarystructures described but cover all relevant memory structures within thespirit and scope of the invention as described herein and as understoodby one of skill in the art.

With the above embodiments in mind, it should be understood that theinvention may employ various computer-implemented operations involvingdata stored in computer systems. These operations are those requiringphysical manipulation of physical quantities. Usually, though notnecessarily, these quantities take the form of electrical or magneticsignals capable of being stored, transferred, combined, compared, andotherwise manipulated. Further, the manipulations performed are oftenreferred to in terms, such as producing, identifying, determining, orcomparing.

The invention may be practiced with other computer system configurationsincluding hand-held devices, microprocessor systems,microprocessor-based or programmable consumer electronics,minicomputers, mainframe computers and the like. The invention may alsobe practiced in distributing computing environments where tasks areperformed by remote processing devices that are linked through anetwork.

With the above embodiments in mind, it should be understood that theinvention may employ various computer-implemented operations involvingdata stored in computer systems. These operations are those requiringphysical manipulation of physical quantities. Usually, though notnecessarily, these quantities take the form of electrical or magneticsignals capable of being stored, transferred, combined, compared, andotherwise manipulated. Further, the manipulations performed are oftenreferred to in terms, such as producing, identifying, determining, orcomparing.

It will be further appreciated that the instructions represented by theoperations in the above figures are not required to be performed in theorder illustrated, and that all the processing represented by theoperations may not be necessary to practice the invention. Further, theprocesses described in any of the above figures can also be implementedin software stored in any one of or combinations of the RAM, the ROM, orthe hard disk drive.

Although the foregoing invention has been described in some detail forpurposes of clarity of understanding, it will be apparent that certainchanges and modifications may be practiced within the scope of theappended claims. Accordingly, the present embodiments are to beconsidered as illustrative and not restrictive, and the invention is notto be limited to the details given herein, but may be modified withinthe scope and equivalents of the appended claims.

What is claimed is:
 1. A memory system including: a memory operatingsystem logic including: computer readable instructions on computerreadable media for receiving a first quantity of user data to be storedin the memory system; computer readable instructions on computerreadable media for writing the first quantity of user data to a firstlocation of the memory system; computer readable instructions oncomputer readable media for writing the first quantity of user data fromthe first location to a first selected memory location within the memorysystem; computer readable instructions on computer readable media forwriting the first quantity of user data from the first location into asecond location of the memory system when the first quantity of userdata was successfully written to the first selected memory location; andcomputer readable instructions on computer readable media for confirminga quantity of stored data in the first location matches the firstquantity of user data in the second location.
 2. The system of claim 1,wherein the computer readable instructions on computer readable mediafor confirming the quantity of stored data in the first location matchesthe first quantity of user data in the second location, includescomputer readable instructions on computer readable media for retrievingthe quantity of stored data in the first selected memory location andwriting the retrieved quantity of stored data into the first location.3. The system of claim 1, further comprising: computer readableinstructions on computer readable media for writing the first quantityof user data from the first location into a second selected memorylocation in the memory system when the first quantity of user data wasnot successfully written to the first selected memory location; andcomputer readable instructions on computer readable media for writing asecond quantity of user data to a third selected memory location in thememory system when the second quantity of user data is stored in a thirdlocation of the memory system.
 4. The system of claim 3, furthercomprising a dedicated error handling memory block in the memory systemand wherein the second selected memory location is included in thededicated error handling memory block.
 5. The system of claim 1, furthercomprising a dedicated error handling memory block in the memory system.6. The system of claim 1, further comprising a memory controller,wherein at least a portion of the memory operating system logic isincluded in the memory controller.
 7. The system of claim 1, furthercomprising a memory array including a plurality of memory blocks ofmemory storage locations and at least a portion of the memory operatingsystem logic is included in the memory array.
 8. The system of claim 1,further comprising a memory array including a plurality of memory blocksof memory storage locations and at least a portion of a memorycontroller is included in the memory array.
 9. The system of claim 1,further comprising a memory array including a plurality of memory blocksof memory storage locations and wherein the memory array includes adedicated error handling memory block.
 10. The system of claim 1,wherein the memory system is included in a computer system, the computersystem including a processor coupled to the memory system by a data bus.11. The system of claim 1, wherein at least a portion of the computerreadable instructions on computer readable media are included in atleast one integrated circuit.
 12. A data storage system comprising: aplurality of computer systems coupled by a data network, each one of theplurality of computer systems including: a memory system coupled to aprocessor by a data bus; and a memory operating system logic including:computer readable instructions on computer readable media for receivinga first quantity of user data to be stored in the memory system;computer readable instructions on computer readable media for writingthe first quantity of user data to a first location of the memorysystem; computer readable instructions on computer readable media forwriting the first quantity of user data from the first location to afirst selected memory location within the memory system; computerreadable instructions on computer readable media for writing the firstquantity of user data from the first location into a second location ofthe memory system when the first quantity of user data was successfullywritten to the first selected memory location; and computer readableinstructions on computer readable media for confirming a quantity ofstored data in the first location matches the first quantity of userdata in the second location.
 13. The system of claim 12, wherein thememory system includes a plurality of memory storage locations and thememory operating system logic is for operating the plurality of memorystorage locations.
 14. The system of claim 12, wherein the computerreadable instructions on computer readable media for confirming thequantity of stored data in the first location matches the first quantityof user data in the second location, includes computer readableinstructions on computer readable media for retrieving the quantity ofstored data in the first selected memory location and writing theretrieved quantity of stored data into the first location.
 15. Thesystem of claim 12, further comprising: computer readable instructionson computer readable media for writing the first quantity of user datafrom the first location into a second selected memory location in thememory system when the first quantity of user data was not successfullywritten to the first selected memory location; and computer readableinstructions on computer readable media for writing a second quantity ofuser data to a third selected memory location in the memory system whenthe second quantity of user data is stored in a third location of thememory system.
 16. A mass data storage system comprising: a plurality ofmemory storage locations; and a memory operating system logic forstoring data in the plurality of memory storage locations, the memoryoperating system logic including: computer readable instructions oncomputer readable media for receiving a first quantity of user data tobe stored in the memory system; computer readable instructions oncomputer readable media for writing the first quantity of user data to afirst location of the memory system; computer readable instructions oncomputer readable media for writing the first quantity of user data fromthe first location to a first selected memory location within the memorysystem; computer readable instructions on computer readable media forwriting the first quantity of user data from the first location into asecond location of the memory system when the first quantity of userdata was successfully written to the first selected memory location; andcomputer readable instructions on computer readable media for confirminga quantity of stored data in the first location matches the firstquantity of user data in the second location.
 17. The system of claim16, wherein the mass data storage system is external from a computersystem and coupled to the computer system via a network interface and adata network.
 18. The system of claim 16, wherein the mass data storagesystem is external from a plurality of computer systems and coupled toeach one of the plurality of computer systems via one or more datanetworks.
 19. A system for managing a computer memory system comprising:a memory operating system logic for writing to and reading data from aplurality of memory storage locations in the computer memory system,wherein at least a portion of the memory operating system logic isimplemented in an integrated circuit included in a portion of a memorycontroller for the computer memory system, the memory operating systemlogic including: computer readable instructions on computer readablemedia for receiving a first quantity of user data to be stored in thememory system; computer readable instructions on computer readable mediafor writing the first quantity of user data to a first location of thememory system; computer readable instructions on computer readable mediafor writing the first quantity of user data from the first location to afirst selected memory location within the memory system; computerreadable instructions on computer readable media for identifying afailure in the first selected memory location; and computer readableinstructions on computer readable media for identifying the firstselected memory location as being unusable.
 20. The system of claim 19,wherein the computer readable instructions on computer readable mediafor identifying the failure in the first selected memory locationincludes computer readable instructions on computer readable media fordetermining the first quantity of user data was not successfully writtento the first selected memory location.
 21. The system of claim 20,further comprising computer readable instructions on computer readablemedia for writing the first quantity of user data from the firstlocation into a second selected memory location in the memory systemwhen the first quantity of user data was not successfully written to thefirst selected memory location.
 22. The system of claim 21 furthercomprising computer readable instructions on computer readable media forwriting a second quantity of user data to a third selected memorylocation in the memory system when the second quantity of user data isstored in a third location of the memory system.
 23. The system of claim21, wherein the computer readable instructions on computer readablemedia for identifying the failure in the first selected memory locationincludes computer readable instructions on computer readable media forconfirming a quantity of stored data in the first location matches thefirst quantity of user data in the second location.